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1.  Introduction 

This  paper  is  concerned  with  the  acquisition  of  the  lexi¬ 
cal.  In  particular,  we  prc^se  a  method  that  uses  ana¬ 
logical  reasoning  to  hypothesize  new  polysemous  word 
senses.  This  method  is  one  of  a  number  of  knowledge 
acquisition  devices  to  be  included  in  DIRC  (Domain 
Indq}endent  Retargetable  Consultant).  DIRC  is  a  kind 
of  intelligent,  natural  language-capable  consultant  kit 
that  can  be  retargeted  at  different  domains.  DIRC  is 
essentially  “empty-UC”  (UNIX  Consultant,  Wilensky 
et  al.,  1988).  DIRC  is  to  include  the  language  and  rea¬ 
soning  mechanisms  of  UC,  plus  a  large  grammar  and  a 
general  lexicon.  The  user  must  then  add  domain 
knowledge,  user  knowledge  and  lexical  knowledge  fa* 
the  area  of  interest 


2.  Previous  Work  in  Acquisition  of  the  Lexicon. 

There  have  been  numerous  attempts  to  build  systems 
that  automatically  acquire  wcxd  meanings.  Mostly, 
these  have  been  either  dictionary  readers  or  attempts  to 
hypothesize  meanings  of  completely  unfamiliar  words 
from  context  (e.g..  Selfridge  (1982),  Granger  (1977)). 
In  contrast,  we  have  focussed  on  the  problem  of  acquir¬ 
ing  word  senses  that  are  related  to  ones  already  known. 

2.1.  A  Note  on  Word  Senses 

For  the  purposes  at  hand,  we  are  only  concerned  with 
word  senses  that  are  synchronically  related.  These  may 
be  polysemous  senses  of  individuals  words,  as  well  as 
related  senses  of  different  words.  In  addition,  we  distin¬ 
guish  meanings  or  conception  structures  of  a  word  from 
senses.  (We  will  use  the  term  "meaning”  and  "con- 
cq>tual  structure”  interchangely  in  this  context)  A  sin- 
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gle  meaning  may  be  involved  in  any  number  of  senses, 
each  of  which  has  grammatical  or  other  differences. 
Typically,  a  word  has  at  least  one  core  meaning  from 
which  the  meanings  involved  in  other  senses  are  in  some 
sense  synchronically  based. 

For  example,  the  word  “open”  has  adjectival  and  verbal 
senses;  the  verbal  senses  include  some  whose  meaning 
is,  roughly,  making  physical  access  available  to  an 
enclosed  region  by  moving  some  object  (e.g.,  "open  a 
jar”,  “open  a  draw”,  “open  the  door”).  This  is  prob¬ 
ably  a  core  meaning  of  the  word.  There  are  several 
senses  involving  this  meaning,  just  among  the  verbal 
senses.  These  senses  are  differentiated  from  one  another 
by  how  the  components  of  the  meaning  relate  to  the 
verb’s  valence.  For  example,  one  sense  has  the  object 
moved  as  the  patient,  and  hence  as  the  direct  object  of 
the  transitive  verb  (as  in  “open  the  door”);  another  uses 
the  container  itself  as  the  direct  object  (e.g.,  “open  the 
jar”);  perhaps  another  involves  some  sort  of  aperture 
that  widens  (e.g.,  “open  your  throat”  or  “open  the  pupil 
of  your  eye”).  Additionally,  each  of  these  components 
of  the  meaning  can  be  realized  as  patients  by  appearing 
as  the  subject  of  the  intransitive  version  of  the  verb.  We 
consider  each  differentiable  valence  structure  for  both 
the  transitive  and  intransitive  verb  forms  as  constituting 
different  senses,  although  we  presume  that  the  same 
conceptual  structure  is  in  all  of  these  examples. 

Yet  other  senses  of  “open”  have  the  meaning  of  caus¬ 
ing  an  information-containing  item  to  come  into 
existence  (e.g.,  “open  a  bank  account”  or  “open  a  file 
on  someone”).  This  second  meaning  is  probably  based 
on  the  first  one.  Also,  the  various  adjective  uses  (e.g., 
“the  open  door”)  are  separate  senses  in  this  view  hav¬ 
ing  some  as  yet  unspecified  relation  to  the  senses 
described  above.  Finally,  other  words,  e.g.,  “close”, 
have  senses  that  we  presume  to  be  related  to  the  various 
senses  of  “open”  just  discussed. 


2.2.  MIDAS 

Previously,  we  have  succeeded  in  doing  some  automatic 
lexical  acquisition  by  exploiting  conventional  metaphors 
as  motivations  for  linguistic  forms.  In  particular,  Martin 
(1988)  implemented  the  MIDAS  system  which  both  uses 
metaphoric  word  senses  to  help  with  language  under- 
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standing,  and  to  extend  the  lexicon  when  a  new  meta¬ 
phoric  use  of  a  word  is  encountered.  For  example,  the 
sentence  “John  have  Mary  a  cold.”  is  presumed  to 
make  recourse  to  a  “a  cold  is  a  possession**  metaphor. 
We  call  such  a  conventionalized  metaphor  a  core  meta- 
frfior,  since  it  seems  to  serve  as  the  basis  for  related 
metaphoric  uses.  Thus,  the  sentence  “John  gave  Mary  a 
cdd’*  is  presumed  to  involve  the  “infecting  with  a  cold 
is  giving  the  cold**  metaphor,  which  entails  the  previous 
‘*cold  is  possession**  metaphor. 

Suppose  the  system  encounters  an  utterance  like  *‘John 
got  the  flu  from  Mary**,  but  is  not  familiar  with  this  use 
of  the  verb  “get**,  nor  with  the  notion  of  a  flu  being 
treated  as  a  possession.  Then  both  the  available  non- 
metaphoric  sense  of  “get**,  along  with  the  metaphors 
involving  diseases  and  possession,  are  brought  to  bear  to 
hypothesize  the  word  sense  that  might  be  in  play. 
Hypotheses  are  generated  by  two  kinds  of  lexical  exten¬ 
sion  processes;  core  extension  and  similarity  extension. 
Understanding  *‘get  a  cold**  given  an  understanding  of 
“give  a  cold**  involves  core  extension,  as  the  core  meta- 
irfior  “cold  is  possession**  is  extended  to  the  “getting** 
concept;  understanding  “get  the  flu**  given  an  undo'- 
standing  of  “get  a  cold**  involves  similarity  extension, 
as  the  generalization  about  a  role  in  the  metaphoric 
structure  must  be  extended  from  colds  to  diseases  in 
general.  Understanding  *‘get  the  flu**  given  an  under¬ 
standing  of  “give  a  cold**  involves  both  kinds  of  exten¬ 
sion. 

The  MIDAS  system  has  been  used  in  conjunction  with 
UC  to  extend  metaphoric  word  senses  in  the  computer 
domain.  The  following  is  an  example  of  MIDAS  learn¬ 
ing  a  new  sense  of  the  word  “Idll**,  given  that  it  knows 
some  metaphoric  extensions  of  this  sense  outside  the 
computer  domain. 


#  How  can  I  kill  a  process? 

No  valid  interpretations.  Atten^Jting 
to  extend  existing  metaphor. 

Searching  for  related  known  metaphors. 

Metaphors  found:  Kill-Conversation 

Kill-Delete-Line  Kill-Sports-Defeat 

Selecting  metaphor  Kill-Conversation 
to  extend  from. 

Attempting  a  similarity  extension 
inference . 

Extending  similar  metaphor  Kill- 
Conversation  with  target  concept 
Terminate-Conversation . 


Abstracting  Terminate-Conversation  to 
ancestor  concept 

Creating  new  metaphor: 

Mapping  main  source  concept  Killing 
to  main  target  concept 

Terminate-Conputer-Process 

Mapping  source  role  killer  to  target 
role  c-proc-termer. 

Mapping  source  role  kill-victim  to 
target  role  c-proc-termed. 

Calling  UC: 

You  can  kill  a  computer  process  by 
typing  “c  to  the  shell. 


Here  MIDAS  first  retrieves  a  number  of  metaphors 
related  to  the  input;  of  these,  “Kill-Convosation**  is 
chosen  as  most  applicable.  A  simple  similarity  exten¬ 
sion  is  attempted,  resulting  in  a  proposed  “Terminaie- 
Computer-Process**  metaphor  for  interpretation  of  the 
input.  The  interpretation  thus  provided  is  passed  along 
to  UC,  which  can  answer  this  question.  Meanwhile,  the 
metaphor  is  incorporated  into  UC*s  knowledge  base, 
which  allows  UC*s  language  generates  to  use  the  same 
terminology  in  encoding  the  answer. 

MIDAS  is  discussed  in  detail  in  Martin  (1988). 


3.  Why  MIDAS  Works 

We  believe  that  MIDAS  works  because  it  is  exploiting 
met^horic  subregularity  by  a  form  of  analogical  rea¬ 
soning.  That  is,  it  finds  a  metaphorical  usage  that  is 
closest  to  the  given  case  according  to  some  conceptual 
metric;  it  then  exploits  the  structure  of  the  prior  meta¬ 
phor  usage  to  construct  an  analogous  one  for  the  case  at 
hand,  and  proposes  this  new  structure  as  a  hypothetical 
word  sense.  Note  that  according  to  this  explanation, 
metaphor  does  not  play  a  crucial  role  in  the  extension 
process.  Rather,  it  is  the  fact  that  the  metapho’  is  a 
subregularity  rather  than  the  fact  that  it  is  a  metaphor 
that  makes  it  amenable  to  analogical  exploitation. 

Analogy,  of  course,  has  played  a  prominent  role  in  tradi¬ 
tional  linguistics.  Indeed,  rather  influential  linguists  (for 
example,  Paul  (1891)  and  Bloomfield  (1933)  seemed  to 
attribute  all  novel  language  use  to  analogy.  However, 
today,  analogy  seems  almost  entirely  relegated  to 
diachronic  processses.  A  notable  exception  to  this  trend 
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is  the  work  of  Skousen  (in  press),  who  appears  to  advo¬ 
cate  a  view  quite  similar  to  our  own,  although  die  pri¬ 
mary  focus  of  his  work  is  morphological. 

Analogy  has  also  been  widely  studied  in  artificial  intelli¬ 
gence  and  cognitive  psychology.  The  woik  of  Carbonell 
(1982)  and  Burstein  (1983)  is  most  relevant  to  our  enter¬ 
prise,  as  it  explores  the  role  of  analogy  in  knowledge 
acquisition.  Similarly,  Alterman's  (1985,  1988) 

approach  to  planning  shares  some  of  the  same  concerns. 
However,  many  of  the  details  of  Carbonell’s  and 
Alleiman’s  proposals  are  specific  to  problem  solving, 
and  Burstein’s  work  is  focused  on  formulating  con¬ 
straints  on  the  relations  to  be  considered  for  analogical 
mapping.  Thus,  their  work  does  not  appear  to  have  an 
obvious  application  to  our  problem.  Many  of  the  diHier- 
ences  between  analogical  reasoning  for  problem  solving 
and  language  knowledge  acquisition  are  discussed  at 
length  in  Martin  (1988). 

Another  line  of  related  wwk  is  the  connectionist 
approach  initiated  by  Rumelhart  and  McClelland  (1987), 
and  explicitly  considered  as  an  alterative  to  acquisition 
by  analogy  by  MacWhinney  et  al.  (1989).  However, 
there  are  numerous  reasons  we  believe  an  explicitly  ana¬ 
logical  framework  to  be  superior.  The  Rumelhart- 
McGelland  model  maintains  no  memory  of  specific 
cases,  but  only  a  statistical  summary  of  them.  Also,  the 
analogy-based  model  can  use  its  knowledge  mtve  flexi¬ 
bly,  for  example,  to  infer  that  a  word  encountered  is  the 
past  tense  of  a  known  word,  a  task  that  an  associationist 
networks  could  not  easily  be  made  to  perform.  In  addi¬ 
tion,  we  interpret  as  evidence  supportive  of  a  position 
like  ours  psycholinguistic  results  such  as  those  of  Cutler 
(1983)  and  Butterworth  (1983),  which  suggest  that 
words  are  represented  in  their  full  “undecomposed” 
form,  along  with  some  sorts  of  relations  between  related 
words. 


3.1.  Other  Kinds  of  Lexical  Subregularities 

If  MIDAS  works  by  applying  analogical  reasoning  to 
exploit  metaphoric  subregularities,  then  the  question 
arises  as  what  other  kinds  of  lexical  subregularities  there 
might  be.  One  set  of  candidates  is  approached  in  the 
work  of  Brugman  (1981,  1984)  and  Norvig  and  Lakoff 
(1987).  In  particular,  Norvig  and  Lakoff  (1987)  offer 
six  types  of  links  between  word  senses  in  what  they  call 
lexica/  network  theory.  However,  their  theory  is  con¬ 
cerned  only  with  senses  of  one  word.  Also,  there  appear 
to  be  many  more  links  than  these.  Indeed,  we  have  no 
reason  to  believe  that  the  number  of  such  subregularities 
is  bounded  in  principle. 

We  present  a  partial  list  of  some  of  the  subregularities 
we  have  encountered.  The  list  below  uses  a  rather  infor¬ 
mal  rule  format,  and  gives  a  couple  of  examples  of 


words  to  which  the  rule  is  iq^plicable.  It  is  hoped  that 
explicating  a  few  examples  telow  will  let  the  reader 
infer  the  meanings  of  some  of  the  others: 

(1)  function-object-noun  ->  primary-activity- 
‘  ‘determinerless’  ’-noun 

(“the  bed’’  ->  “in  bed,  go  to  bed”;  “a  school  ->  at 
school”;  “my  lunch  ->  at  lunch”-;  “the  conference  -> 
in  conference”) 

(2)  noun  ->  lesembling-in-appearance-noun 

(“tree”  ->  “(rose)  tree”;  “tree”  ->  “(shoe)  tree”); 
“tiger”  ->  “(stuffed)  tiger”,  “pencil”  ->  “pencil  (of 
light)”) 

(3)  noun  ->  having-the-same-function-noun 
(“bed”  ->  “bed  (of  leaves)”) 

(4)  noun  ->  “involve-concretion”-verb 

(“a  tree”  ->  “to  tree  (a  cat)”;  “a  knife”  ->  “to  knife 
(someone)”) 

(5)  verb  ->  verb-w-role-splitting 

(“take  a  book”  ->  “take  a  book  to  Mary”,  “John 
shaved”  ->  “John  shaved  Bill”) 

(6)  verb  ->  profiled-component-verb 

(“take  a  book”  ->  “take  a  book  to  the  Cape”) 

(7)  verb  ->  frame-imposition-verb 

(“take  a  book”  ->  “take  someone  to  dinner”,  “go”  -> 
“go  dancing”) 

(8)  activity-verb-t  ->  concretion-activity-verb-i 

(“eat  an  ^le”  ->  “cat  [a  meal]”,  “drink  a  c(*e”  -> 
“drink  [alcohol]”,  “feed  the  dog”  ->  “the  dog  feeds”) 

(9)  activity-verb-t  ->  dobj-subj-middle-voice-verb-i 
(“drive  a  car”  ->  “the  car  drives  well”) 

(10)  activity-verb-i  ->  activity-verb-t-primary-category 
(“John  dreamed”  ->  “John  dreamed  a  dre^”;  “John 
slq)t”  ->  “John  slept  the  sleep  of  the  innocent”) 

(11)  activity-verb-i  ->  do-cause-activity-veib-t(patient 
as  subject) 

(“John  slept”  ->  “The  bed  sleeps  five”) 

(12)  activity-verb  ->  activity-of-noun 

(“to  cry”  ->  “a  cry  (in  the  wilderness)”;  “to  punch” 
->  “a  punch  (in  the  mouth)”) 

(13)  activity-verb  <->  product-of-activity-noun 
(“copy  the  paper”  <->  “a  copy  of  the  paper”;  xerox, 
telegram,  telegrairfi) 
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(14)  functional-noun  ->  use-function-veib 

(“the  telephone”  ->  “telephcxie  John”;  machine, 
motoTcycle,  telegn^h) 

(15)  object-noun  ->  central-component-of-object 

(“a  bed”  ->  “bought  a  bed  [sframe  with  not  mattress]; 
“an  apple”  ->  “eat  an  ^le  [=without  the  core]”)) 

C^onsider  the  first  rule.  This  rule  states  that,  for  some 
noun  whose  core  meaning  is  a  functional  object,  there  is 
another  sense,  also  a  noun,  that  occurs  without  determi¬ 
nation,  and  means  the  primary  activity  associated  with 
the  first  sense.  For  example,  the  word  “bed”  has  as  a 
core  meaning  a  functional  object  used  for  sleeping. 
However,  the  wixd  can  also  be  used  in  utterances  like 
“go  to  bed”  and  “before  bed”  (but  not,  say,  “*during 
bed”).  In  these  cases,  the  noun  is  determinerless,  and 
means  something  akin  to  sleeping.  Other  examples 
iiKlude  “jail”,  “conference”,  “school”  and  virtually 
all  the  meal  trams,  e.g.,  “lunch”,  “tea”,  “dinner”. 
British  English  allows  “in  hospital”,  while  American 
English  does  not 

The  dialect  difference  undrascores  the  point  that  this  is 
truly  a  subregularity:  concepts  that  might  be  expressed 
this  way  are  not  necessarily  expressed  this  way.  Also, 
we  chose  this  example  not  because  it  in  itself  is  a  partic¬ 
ularly  important  generalization  about  English,  but  pre¬ 
cisely  because  it  is  not.  That  is,  there  appear  to  be  many 
such  facts  of  limited  scope,  and  each  of  them  may  be 
useful  for  learning  analogous  cases. 

Consider  also  rule  4,  which  relates  function  nouns  to 
verbs.  Examples  of  this  are  “tree”  as  in  “The  dog 
treed  the  cat”  and  “knife”  as  in  “The  murderer  knifed 
his  victim”.  The  applicable  rule  states  that  the  verb 
means  some  specific  activity  involving  the  core  meaning 
of  the  noun.  I.e.,  the  verbs  are  treated  as  a  sort  of  con¬ 
ventionalized  denominalization.  Note  that  the  activity  is 
presumed  to  be  specific,  and  that  the  way  in  which  it 
must  be  “concreted”  is  assumed  to  be  {nagmatically 
determined.  Thus,  the  rule  can  only  tell  us  that  “tree¬ 
ing”  involves  a  tree,  but  only  our  world  knowledge 
might  suggest  to  us  that  it  involves  cornering;  similarly, 
the  rule  can  tell  us  that  “knifing”  involves  the  use  of  a 
knife,  but  cannot  tell  us  that  it  means  stabbing  a  person, 
and  not  say,  just  cutting. 

As  a  final  illustration,  consider  rule  5,  so-called  “role 
q)iitting”  (this  is  the  same  as  Norvig  and  LakofTs 
semantic  role  differentiation  link).  This  rule  suggests 
that,  given  a  verb  in  which  two  thematic  roles  are  real¬ 
ized  by  a  single  complement  may  have  another  sense  in 
which  these  two  complements  are  realized  sq>arately. 
For  example,  in  “John  took  a  book  from  Mary”,  John  is 
both  the  recipient  and  the  agent  However,  in  “John 
took  a  book  to  Mary”,  John  is  only  the  agent  and  Mary 
is  the  recipient  Thus,  the  sense  of  “take”  involved  in 


the  first  sentence,  which  we  suggest  corresponds  to  a 
core  meaning,  is  the  basis  for  the  sense  used  in  the 
second,  in  which  the  roles  coinciding  in  the  first  are  real¬ 
ized  separately.  A  similar  prediction  might  be  made 
from  an  intransitive  verb  like  “shave”,  in  which  agent 
and  patient  coincide,  to  the  existence  of  a  transitive  verb 
“shave”  in  which  the  patient  is  realized  separately  as 
the  direct  object  (Of  course,  the  tendency  of  patients  to 
get  realized  as  direct  objects  in  English  should  also  help 
motivate  this  fact,  and  can  presumably  also  be  exploited 
analogically.) 


4.  An  Analogy-based  Model  of  Lexical  Acquisition 

We  have  been  attempting  to  extend  MIDAS-style  word 
hypothesizing  to  be  ^le  to  propose  new  word  senses  by 
using  analogy  to  exploit  these  other  kinds  of  lexic^ 
subregularities.  At  this  point,  our  work  has  been  rather 
preliminary,  but  we  can  at  least  sketch  out  the  basic 
architecture  of  our  proposal  and  comment  on  the  prob¬ 
lems  we  have  yet  to  resolve. 

(A)  Detect  unknown  wrad  sense.  For  example,  suppose 
the  system  encountered  the  following  i^irase: 

“at  breakfast” 

Suppose  further  that  the  function  noun  “breakfa.«t” 
were  known  to  the  system,  but  the  determinerless  usage 
were  not.  In  this  case,  the  system  would  hypothesize 
that  it  is  lacking  a  word  sense  because  of  a  failure  to 
parse  the  sentence. 

(B)  Find  relevant  cases/subregularities.  Cues  from  the 
input  would  be  used  to  suggest  prior  relevant  lexical 
knowledge.  In  our  example,  the  retrieved  cases  might 
itKlude  the  following: 

bed-l/bed-3,  class- l/class-4 

Here  we  have  numbered  word  senses  so  that  the  first 
element  of  each  pair  designates  a  sense  involving  a  core 
meaning,  and  the  latter  a  determineless-acdvity  type  of 
sense.  We  may  have  also  already  computed  and  stored 
relevant  subregularities.  If  so,  then  these  would  be 
retrieved  as  well. 

Relevant  issues  here  are  the  indexing  and  retrieval  of 
cases  and  subregularities.  Our  assumption  is  that  we  can 
retrieve  relevant  cases  by  a  conjunction  of  simple  cues, 
like  “noun”,  “function^  meaning”,  “extended  detra- 
minraless  noun  sense”,  etc.,  and  then  rely  on  the  next 
phase  to  discriminate  further  among  these. 

(C)  Chose  the  most  pertinent  case  or  subregularity. 
Again,  by  analogy  to  ^DAS,  some  distance  metric  is 
used  to  pick  the  best  datum  to  analogize  from.  In  this 


368 


case,  peihaps  the  ccMrect  choice  would  be  the  following: 

class- l/class-4 

One  motivation  for  this  selection  is  that  “class"  is  com¬ 
patible  with  “at”,  as  is  the  case  in  point 

Finding  the  right  metric  is  the  primary  issue  here.  The 
MIDAS  metric  is  a  simple  sum  of  two  factors:  (i)  the 
length  of  the  core-relationship  from  the  input  source  to 
the  source  of  the  candidate  metaphor,  and  (ii)  hierarchi¬ 
cal  distance  between  the  two  concepts.  Both  factors  are 
measured  by  the  number  of  links  in  the  rqnesentation 
that  must  be  traversed  to  get  from  one  concq}t  to  the 
other.  The  hierarchical  distance  factor  of  the  MIDAS 
metric  seems  directly  relevant  to  other  cases.  However, 
there  is  no  obvious  counterpart  to  the  core-relationship 
component  One  possible  reason  for  this  is  that  meta¬ 
phoric  extensions  are  more  complex  than  most  other 
kinds;  if  so,  then  the  MIDAS  metric  may  still  be  applica¬ 
ble  to  the  other  subregularities,  which  are  just  simpler 
special  cases. 

(D)  Analogize  to  a  new  meaning.  Given  the  best  case  or 
subregularity,  the  system  will  attempt  to  hypothesize  a 
new  word  sense.  For  example,  in  the  case  at  hand,  we 
would  like  a  rqrresentation  for  the  meaning  in  quotes  to 
be  produced. 

class- l/class4 :: 

breakfast- l/“period  of  eating  breakfast” 

In  the  case  of  MIDAS,  the  meuphoric  structure  of  pre¬ 
vious  examples  was  assumed  to  be  available.  Then, 
once  a  best  match  was  established,  it  is  relatively 
straightforward  to  generalize  or  extend  this  structure  to 
apply  to  the  new  input  The  same  would  be  true  in  the 
general  case,  jx'ovided  that  the  relation  between  stored 
polysemous  wo’d  senses  is  readily  available. 

(E)  Determine  the  extent  of  generalization.  Supposing 
that  a  single  new  word  sense  can  be  successfully  pro¬ 
posed,  the  question  arises  as  to  whether  just  this  particu¬ 
lar  word  sense  is  all  the  system  can  hypothesize,  or 
whetho-  some  “local  productivity”  is  possible.  For 
example,  if  this  is  the  first  meal  term  the  system  has  seen 
as  having  a  determinerless  activity  sense,  we  suspect 
that  only  the  single  sense  should  be  generated.  How¬ 
ever.  if  it  is  the  second  such  meal  term,  then  the  first  one 
would  have  been  the  likely  basis  for  the  analogy,  and  a 
generalization  to  meal  terms  in  general  may  be 
attempted. 

(F)  Record  a  new  entry.  The  new  sense  needs  to  be 
stored  in  the  lexicon,  and  indexed  for  further  reference. 
This  task  may  interact  closely  with  (E),  although  gen¬ 
eralizing  to  unattested  cases  and  computing  explicit 
subregularities  are  logically  independent 


There  are  many  additional  problems  to  be  addressed 
beyond  the  ones  alluded  to  atove.  In  particular,  there  is 
the  issue  of  the  role  of  world  knowledge  in  the  proposed 
process.  In  the  example  above,  the  system  must  know 
that  the  activity  of  eating  is  the  {nimary  one  associated 
with  breakfast  A  mote  dramatic  example  is  the  role  of 
world  knowledge  in  hypothesizing  the  meaning  of 
“treed”  in  expressions  like  “the..dog  treed  the  cat”, 
assuming  that  the  system  is  acquainted  with  the  noun 
“tree”.  All  an  analogical  reasoning  mechanism  can  do 
is  suggest  that  some  specific  activity  associated  with 
trees  is  involved;  the  application  of  worid  knowledge 
would  have  to  do  the  rest 


5.  Other  Directions  of  Investigation 

We  have  also  been  investigating  exploiting  sulxegulari- 
ties  in  “intelligent  dictionary  reading”.  This  project 
involves  an  additional  idea,  namely,  that  one  could  best 
use  a  dictionary  to  gain  lexical  knowledge  by  bringing  to 
bear  on  it  a  fidl  natural  language  processing  capability. 
One  problem  we  have  encountered  is  that  dictionaries 
are  full  of  inaccuracies  about  the  meaning  of  wo'ds.  For 
example,  even  relatively  good  dictionaries  have  poor 
entries  for  the  likes  of  determinerless  nouns  like  “b^”. 
E.g.,  Webster’s  New  World  (Second  Edition)  simply 
lists  “bedtime”  as  a  sense  of  “bed”;  Longman’s  Dic¬ 
tionary  of  Contemporary  English  (New  Edition)  uses 
“in  b^”  as  an  example  of  the  wdinary  noun  “bed”, 
then  explicitly  lists  the  phrase  “time  for  bed”  as  mean¬ 
ing  “time  to  go  to  sleep”,  and  gives  a  few  other  deter¬ 
minerless  usages,  leaving  it  to  the  reader  to  infer  a  gen¬ 
eralization.*  However,  a  dictionary  reader  with 
knowledge  o{  the  subregularity  mentioned  above  might 
be  able  to  correct  such  deficiencies,  and  come  up  with  a 
better  meaning  that  the  one  the  dictionary  supplies. 
Thus,  we  plan  to  explore  augmenting  our  intelligent  dic¬ 
tionary  reader  with  the  ability  to  use  subregularities  to 
compensate  for  inadequate  dictionary  entries. 

We  are  also  attempting  to  q>ply  the  same  approach  to 
acquiring  the  semantics  of  constructions.  In  particular, 
we  are  investigating  verb-particular  combinations  and 
conventionalized  tKxin  phrases  (e.g.,  nominal  com¬ 
pounds).  We  are  also  looking  at  cmtstructions  like  the 
ditransitive  (i.e.,  dative  alternation),  which  seem  also  to 
display  a  land  of  polysemy.  Specifically,  Goldberg 
(1989,  1990)  has  argued  that  much  of  the  data  on  this 
construction  can  be  accounted  for  in  terms  of  subclasses 
that  are  conventionally  associated  with  the  construction 
itself,  rather  than  with  lexical  rules  and  transformations 
as  proposed,  for  example,  by  Gropen  et  al.  (1989).  If 
so,  then  the  techniques  for  the  acquisition  of  polysemous 

*Lofiginan’t  alto  defines  "make  the  bed”  u  “make  it  ready  for 
sleeping  in”.  We  have  no  idea  bow  to  oope  with  such  errors,  but 
they  do  underscore  the  problem. 
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lexical  items  should  prove  equally  applicable  to  the 
acquisition  of  knowlet^e  about  such  constructions.  We 
are  attempting  to  determine  whetho-  this  is  the  case. 
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